CDS
Accession Number | TCMCG075C18418 |
gbkey | CDS |
Protein Id | XP_017977573.1 |
Location | join(39222400..39222411,39222496..39222612,39222755..39222813,39222965..39223024,39223172..39223225,39223340..39223388,39223485..39223532,39223701..39223766,39223859..39223908,39224174..39224300,39224376..39224460,39224568..39224649,39225305..39225419,39225517..39225573,39225671..39225717,39225800..39225854,39226291..39226308) |
Gene | LOC18600652 |
GeneID | 18600652 |
Organism | Theobroma cacao |
Protein
Length | 366aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_018122084.1 |
Definition | PREDICTED: flap endonuclease 1 isoform X5 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
COG_category | L |
Description | Structure-specific nuclease with 5'-flap endonuclease and 5'-3' exonuclease activities involved in DNA replication and repair. During DNA replication, cleaves the 5'-overhanging flap structure that is generated by displacement synthesis when DNA polymerase encounters the 5'-end of a downstream Okazaki fragment. It enters the flap from the 5'-end and then tracks to cleave the flap base, leaving a nick for ligation. Also involved in the long patch base excision repair (LP-BER) pathway, by cleaving within the apurinic apyrimidinic (AP) site-terminated flap. Acts as a genome stabilization factor that prevents flaps from equilibrating into structurs that lead to duplications and deletions. Also possesses 5'-3' exonuclease activity on nicked or gapped double- stranded DNA, and exhibits RNase H activity. Also involved in replication and repair of rDNA and in repairing mitochondrial DNA |
KEGG_TC | - |
KEGG_Module | - |
KEGG_Reaction | - |
KEGG_rclass | - |
BRITE |
ko00000
[VIEW IN KEGG] ko00001 [VIEW IN KEGG] ko01000 [VIEW IN KEGG] ko03032 [VIEW IN KEGG] ko03400 [VIEW IN KEGG] ko04147 [VIEW IN KEGG] |
KEGG_ko |
ko:K04799
[VIEW IN KEGG] |
EC | - |
KEGG_Pathway |
ko03030
[VIEW IN KEGG] ko03410 [VIEW IN KEGG] ko03450 [VIEW IN KEGG] map03030 [VIEW IN KEGG] map03410 [VIEW IN KEGG] map03450 [VIEW IN KEGG] |
GOs | - |
Sequence
CDS: ATGGGCATCAAGGGTTTAACGAAGCTTCTAGCGGACAATGCACCCAAGGCCATGAAGGAACAGAAATTCGAGAGCTTTTTCGGCCGCAAGATCGCCATCGACGCCAGCATGAGCATTTACCAGTTTCTCATTGTGGTGGGTCGTAGTGGGACTGAAATGCTCACCAATGAAGCGGGTGAGGTCACCAGTCATCTGCAGGGCATGTTTACTCGTACAATTCGGCTTCTCGAAGCTGGGATCAAACCTGTCTATGTTTTTGACGGTCAGCCTCCTGATTTGAAGAAACAAGAGCTTGCAAAACGTTACTCAAAGAGGGCAGATGCTACTGAGGATTTGCAACAAGCCATGGAGGCTGGCAATAAGGAGGACATTGAAAAATTCAGCAAGCGGACAGTAAAGGTGACAAAGCAGCACAATGAAGACTGTAAACGGCTTTTAAGACTTATGGGGGTACCTGTGATCGAGGCTTCTTCTGAAGCAGAGGCGCAATGTGCTGCACTTTGCAAATCAGGAAAGTTTCAGGTTTATGCTGTGGCTTCTGAGGATATGGATTCTTTAACCTTTGGAGCTCCTAGATTTCTTCGCCATTTAATGGACCCTAGCTCAAGAAAAGTTCCGGTCATGGAGTTTGAAGTTGCAAAGGTTTTGGAGGAGCTGAATCTTACCATGGATCAATTCATTGACTTGTGCATTCTTTCTGGCTGTGATTATTGTGACAGCATTCGAGGTATTGGGGGACAGACAGCTTTGAAGTTAATTCGTCAACATGGGTCTATAGAGCATATTCTTCAGAACATAAACAAAGAGAGGTACTCAATACCTGATGATTGGCCATATCAAGAGGCTCGACAGCTTTTTCAAGAACCATTAGTCTGCACTGATGATGAGCAACTTGAGATGAAGTGGAATGCTCCAGATGACGAAGGGTTGATAACCTTTCTGGTGAATGAAAATGGGTTCAACGGTGACAGAGTGACAAAGGCAATAGAAAAAATTAAAGCAGCCAAGAACAAGTCATCGCAGGGCCGATTAGAGTCATTTTTTAAGCCAGTTGGTAACACATCTATACCAATTAAACGGAAGGCTTATTGGCTGCCATAA |
Protein: MGIKGLTKLLADNAPKAMKEQKFESFFGRKIAIDASMSIYQFLIVVGRSGTEMLTNEAGEVTSHLQGMFTRTIRLLEAGIKPVYVFDGQPPDLKKQELAKRYSKRADATEDLQQAMEAGNKEDIEKFSKRTVKVTKQHNEDCKRLLRLMGVPVIEASSEAEAQCAALCKSGKFQVYAVASEDMDSLTFGAPRFLRHLMDPSSRKVPVMEFEVAKVLEELNLTMDQFIDLCILSGCDYCDSIRGIGGQTALKLIRQHGSIEHILQNINKERYSIPDDWPYQEARQLFQEPLVCTDDEQLEMKWNAPDDEGLITFLVNENGFNGDRVTKAIEKIKAAKNKSSQGRLESFFKPVGNTSIPIKRKAYWLP |